Blog Classification Using Tags: An Empirical Study
نویسندگان
چکیده
With an exponential growth of Weblogs (or blogs), many blog directories have appeared to help users to locate topical blogs. As tags are commonly used to describe blogs, we study the effectiveness of tags in blog classification. Compared with titles and descriptions, our experiments, using 24,247 blogs, showed that tags could lead to better classification accuracy. It is interesting to observe that more tags did not necessarily lead to better classification accuracy. To better describe blogs, we have also proposed a tag expansion algorithm that assigns a blog more tags that are often co-occur with those already associated with the blog. Our experiments showed that tag expansion helped to improve the recall of blog classification with the price of precision degradation.
منابع مشابه
Using Tags and Clustering to Identify Topic-Relevant Blogs
The Web has experienced an exponential growth in the use of weblogs or blogs. Blog entries are generally organised using tags, informally defined labels which are increasingly being proposed as a ‘grassroots’ answer to Semantic Web standards. Despite this, tags have been shown to be weak at partitioning blog data. In this paper, we demonstrate how tags provide useful, discriminating information...
متن کاملClassifying Blog Posts with Tag Propagation
Blog tags are usually considered to be supplementary information for blog post classification tasks. Due to the sparsity of tag features, improving performance of classifiers merely using tags is not a trivial operation. This paper presents a blog post classification approach based on the tag propagation strategy. Using a dataset of blog posts gleaned from the Internet, tags of a blog post are ...
متن کاملTags are not metadata, but "just more content" - to some people
The authoring of tags – unlike the authoring of traditional metadata – is highly popular among users. This harbours unprecedented opportunities for organizing content. However, tags are still poorly understood. What do they “mean”, in what senses are they similar to or different from metadata? Different tags support different communities, but how exactly do they reflect the plurality of opinion...
متن کاملAn Improved Approach for Topic Ontology Based Categorization of Blogs Using Support Vector Machine
Problem statement: Information search, collection and categorization from the blogosphere are still one of the important issues to be resolved. Mainly, the blogs assist the variety of interesting and useful information. Because of its increasing growth, blogs can not be categorized effectively. Therefore it is difficult to find relevant topics from the blogs. Hence blogs need to be categorized ...
متن کاملEvaluating tag filtering techniques for web resource classification in folksonomies
Social or collaborative tagging systems emerged as a novel classification scheme on the Web based on the collective knowledge of people. In sites such as Del.icio.us, Technorati or Flickr, users annotate a variety of resources, including Web pages, blogs, pictures, videos or bibliographic references; using freely chosen textual labels or tags. Underlying collaborative tagging systems are ternar...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007